On The Marriage of Lp-norms and Edit Distance
نویسندگان
چکیده
Existing studies on time series are based on two categories of distance functions. The first category consists of the Lp-norms. They are metric distance functions but cannot support local time shifting. The second category consists of distance functions which are capable of handling local time shifting but are nonmetric. The first contribution of this paper is the proposal of a new distance function, which we call ERP (“Edit distance with Real Penalty”). Representing a marriage of L1norm and the edit distance, ERP can support local time shifting, and is a metric. The second contribution of the paper is the development of pruning strategies for large time series databases. Given that ERP is a metric, one way to prune is to apply the triangle inequality. Another way to prune is to develop a lower bound on the ERP distance. We propose such a lower bound, which has the nice computational property that it can be efficiently indexed with a standard B+tree. Moreover, we show that these two ways of pruning can be used simultaneously for ERP distances. Specifically, the false positives obtained from the B+-tree can be further minimized by applying the triangle inequality. Based on extensive experimentation with existing benchmarks and techniques, we show that this combination delivers superb pruning power and search time performance, and dominates all existing strategies. Permission to copy without fee all or part of this material is granted provided that the copies are not made or distributed for direct commercial advantage, the VLDB copyright notice and the title of the publication and its date appear, and notice is given that copying is by permission of the Very Large Data Base Endowment. To copy otherwise, or to republish, requires a fee and/or special permission from the Endowment. Proceedings of the 30th VLDB Conference, Toronto, Canada, 2004
منابع مشابه
LP problems constrained with D-FRIs
In this paper, optimization of a linear objective function with fuzzy relational inequality constraints is investigated where the feasible region is formed as the intersection of two inequality fuzzy systems and Dombi family of t-norms is considered as fuzzy composition. Dombi family of t-norms includes a parametric family of continuous strict t-norms, whose members are increasing functions of ...
متن کاملStudying the cohabitation subculture in metropolis Tehran (backgrounds and consequents)
Expended Abstract Introduction: In recent years, new methods of relationship between both sexes have emerged which have no family framework. Today, some girls and boys are living together although they have not got married legally and formally; based on a mutual agreement for an unspecified time. Some scholars express that Iran’s society has faced with increasing waves of changes in values an...
متن کاملStudying the cohabitation subculture in metropolis Tehran (backgrounds and consequents)
Expended Abstract Introduction: In recent years, new methods of relationship between both sexes have emerged which have no family framework. Today, some girls and boys are living together although they have not got married legally and formally; based on a mutual agreement for an unspecified time. Some scholars express that Iran’s society has faced with increasing waves of changes in values an...
متن کاملA Study on Social Structure, Marriage and a Romantic Relation
For marriage, we need at least a positive tendency but a forced or expedient marriage leads to divorce. Living in a society full of loving and respecting each other is necessary and it is an outcome of an organized family. This paper tries to analyze the effect of social structure on establishment of sustainable relation and emotional relationship. We can examine this subject from two point ...
متن کاملFast Time Sequence Indexing for Arbitrary Lp Norms
Fast indexing in time sequence databases for similarity searching has attracted a lot of research recently. Most of the proposals, however, typically centered around the Euclidean distance and its derivatives. We examine the problem of multimodal similarity search in which users can choose the best one from multiple similarity models for their needs. In this paper, we present a novel and fast i...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2004